Proc. Natl. Acad. Sci. USA 

Vol. 89, pp. 2027-2031, March 1992 

Biochemistry 

YMXM motifs of IRS-1 define substrate specificity of the insulin 
receptor kinase 

(signal transduction/phosphatidylinositol 3'-klnase / oncogene/ tyrosine kinase/nr homology domain 2) 

Steven E. Shoelson^, Swati Chatterjee, Manas Chaudhuri, and Morris F. White 

Research Division, Joslin Diabetes Center, Departments of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston. MA 02215 



Communicated by Josef Fried, December 6, 1991 

ABSTRACT Of 34 tyrosine residues in insulin receptor 
substrate 1 (IRS-1), 14 are adjacent to acidic residues, sug- 
gesting that they might be phosphorylation sites. Synthetic 
peptides corresponding to sequences surrounding these ty- 
rosines were used as substrates of the insulin receptor kinase. 
Surprisingly six of these, each within YMXM motifs, were 
phosphorylated with greatest efficiency (K m , 24-92 /*M; 
*c*t/#ni, 0.6-2.1 x 10 4 M _1 -sec _1 ). Substituted VMXM pep- 
tides revealed a strong preference of the insulin receptor kinase 
for methionine at Y + 1 and Y + 3 positions. When phosphor- 
ylated, related YMXM sequences are recognition motifs for 
binding to proteins with wc-homology (SH2) domains. The 
combined hydrophobic and flexible nature of methionine side 
chains adjacent to the targeted tyrosines provides a versatile 
contact for recognition by diverse proteins involved in signal 
transduction. 



Insulin binding to the extracellular a subunits of the insulin 
receptor activates tyrosine kinase activity intrinsic to the 
intracellular j8 subunits (1-4). This finding sparked a search 
for cellular substrates that could link the insulin receptor to 
postreceptor signaling events. The first identified endoge- 
nous substrate of the insulin receptor was ppl85, a cytoplas- 
mic phosphoprotein of 165-185 kDa (5). Recently, a cDNA 
corresponding to a component of the ppl85 band was se- 
quenced to provide the deduced primary structure of insulin 
receptor substrate 1 (IRS-1) (6, 7). As IRS-1 has little 
extended sequence homology with other known proteins, an 
intrinsic function for it has not been assigned. IRS-1 is 
phosphorylated on tyrosine residues after insulin stimulation 
(7), although the exact sites of phosphorylation have yet to be 
determined. Of a total of 34 tyrosines, 14 are preceded by 
acidic amino acids, suggesting that they might be targets of 
the insulin receptor kinase. Surprisingly, 6 of these are in 
YMXM motifs and two more are in YXXM motifs. 

The platelet-derived growth factor (PDGF) receptor and 
polyoma virus middle-sized tumor antigen (middle T antigen) 
contain similar YMXM and homologous YVXM motifs, 
which are phosphorylated and thought to be essential for 
binding to phosphatidylinositol 3'-kinase (PI 3-kinase) (8, 9). 
The 85-kDa regulatory subunit of PI 3-kinase contains two 
SH2 domains (10, 11), which are thought to mediate its 
interactions with the PDGF receptor, the pp6(F sr 7middle T 
antigen complex, and other proteins involved in cellular 
signaling (8). In fact, immunoprecipitation studies suggest 
that PI 3-kinase activity is closely associated with IRS-1 after 
insulin stimulation of intact cells (7, 12, 13). Thus, phosphor- 
ylated IRS-1 might act as a "docking protein" to bind and 
regulate PI 3-kinase and additional signal-transducing pro- 
teins containing SH2 domains as well. 
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In this study, we show that peptides corresponding to 
IRS-1 YMXM motifs are excellent substrates of the insulin 
receptor kinase. In fact, within a given sequence the presence 
of methionine residues at Y + 1 and Y + 3 positions is even 
more important than acidic residues N-terminal to tyrosine 
for catalytic efficiency. These findings suggest that in addi- 
tion to being recognition elements for interactions with SH2 
domain-containing proteins, residues within YMXM se- 
quences form a recognition motif for insulin receptor- 
catalyzed phosphorylation. These studies define structural 
requisites for efficient phosphorylation of a cellular substrate 
of the insulin receptor and may extrapolate directly to other 
members of the family of tyrosine kinases and their interac- 
tions with additional elements of signaling cascades. 

EXPERIMENTAL PROCEDURES 

Peptide Synthesis. Solid-phase syntheses were performed 
on an Applied Biosystems model 430A synthesizer using 
standard dicyclohexylcarbodiimide-rnediated preformed 
symmetrical anhydride coupling protocols. Amino acids with 
standard butoxycarbonyl/benzyl protecting groups were 
purchased from Applied Biosystems; additional solvents and 
reagents were the highest purity available. Peptide products 
were cleaved from the resin, side-chain protecting groups 
were removed, and methionine sulfoxide was reduced during 
"low-high"* cleavages with trifluoromethanesulfonic acid 
(14). Peptides were typically quite pure; those having ^95% 
purity by analytical reversed-phase HPLC were purified 
further by preparative HPLC (Waters Prep 4000) on a Dy- 
namax-300A 12-itm C8 column (41.4 x 250 mm) equipped 
with a matched guard column. Peptides were eluted with a 
mobile phase composed of acetonitrile in 0.05% aqueous 
trifluoroacetic acid (80 ml/min), Amino acid analyses of all 
peptides used for kinetic analyses were as predicted. 

Insulin Receptor Preparation. Chinese hamster ovary 
(CHO) cells, transfected with human insulin receptor con- 
structs and expressing 10 6 receptors per cell (15), were grown 
in suspension in a 10-liter spinner flask in modified McCoy's 
5A medium (GIBCO) containing 0.0345 mg of L-proline per 
ml and no CaCl 2 (16, 17). Cells were solubilized in 1.0% 
Triton X-100 and receptors were partially purified on wheat 
germ agglutinin (WGA)/agarose as described (17, 18). WGA- 
purified protein was stored at -70°C until needed. Identical 
aliquots of receptor were used for all assays. 

Substrate Phosphorylation Experiments. WGA-purified in- 
sulin receptor (600 /xl; 1.5 pmol) was incubated sequentially 
with 1 /xM insulin (1 h at 4°C) and 50 pM ATP/5 mM MnCl 2 
(1 h at 22°C) for maximal phosphorylation. The peptides were 
diluted appropriately in a mixture of 50 pM [y- 3 *P] ATP/5 mM 
MnCl 2 in 50 mM Hepes containing 0.1% Triton X-100. 
Substrate phosphorylation reactions were initiated by the 
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addition of 13 /xl (32 fmol) of activated (insulin stimulated and 
ATP phosphorylated) insulin receptor, allowed to proceed at 
22°C for 5 min (final vol, 40 /il), and terminated by addition 
of 65 /xl of 5% trichloroacetic acid. Incorporated phosphate 
was determined by a modification of the phosphocellulosc 
adsorption method (19). High molecular weight species were 
removed by precipitation after incubation in the presence of 
0.16% bovine serum albumin for 30 min at 4°C. Portions of 
each supernatant solution containing phosphorylated pep- 
tides were spotted onto 2-cm 2 pieces of P81 phosphocellulose 
paper (Whatman) and washed four times for 30 min each in 
1.0 liter of 0.075 M phosphoric acid. The papers were rinsed 
in acetone and allowed to air dry; incorporated phosphate 
was determined by Cerenkov counting. 

Kinetic Analyses. Values for 32 P incorporation into the 
peptide substrates (cpm) were converted to pmol/min rates 
(V) to facilitate direct comparisons between peptides; divid- 
ing each pmol/min value by 1.3 x 10~ 5 mg (the amount of 
WG A-purified protein) converts it to the corresponding pmol 
per min per mg of protein value. K m and V max values and the 
corresponding standard errors were determined with the 
assistance of the enzyme program (20), which follows algo- 
rithms for appropriate weighting of data described by Cleland 
(21); each peptide was analyzed in two to five separate 
experiments (see Table 2). The amount of insulin receptor in 
the WGA-purified protein isolated from human insulin re- 
ceptor construct-transfected CHO cells, estimated by 
[ 125 I]insulin binding and Scatchard analyses using the ligand 
program (22), was 25 nmol/mg (range, 10-40 nmol/mg), 
which corresponds to 1% of the lectin-bound protein. This 
value was used to determine turnover rates (/feat) and catalytic 
efficiency (fccat/^m) • 

RESULTS 

Peptide Design. Peptides corresponding to sequences of 
IRS-1 surrounding tyrosine residues were synthesized. All 
consensus sites for tyrosine phosphorylation within YMXM 
motifs were prepared, as were additional consensus se- 
quences not found in YMXM motifs (Table 1). Each peptide 
was designed to include (/) the specificity residue Y, (ii) 
acidic residues N -terminal to Y, (Hi) residues at Y + 1 to Y 
+ 5 positions, including the entire YMXMSP sequence, when 
appropriate, and (iv) as many native basic residues as pos- 
sible, up to a total of three. If sufficient basic residues were 
not present within the native sequence, lysine residues were 
added at the C or N termini to a total of three to guarantee 
adsorption to the phosphocellulose paper used in the phos- 
phopeptide assay (23). Therefore, peptides varied in length 
(10-18 residues) and position of the tyrosine residue. 

Each peptide contains only one tyrosine with the exception 
of Y46,* which contains two side-by-side tyrosines flanked 
by glutamic acid residues (Table 1). Peptides Y460 and Y546 
contain YXXM motifs in which the Y + 1 position is 
isoleucine or threonine, respectively, with acidic residues at 
appropriate positions relative to Y (Table 1). Peptides Y608, 
Y628, Y658, Y727, Y939, and Y987 each contain complete 
YMXM motifs with at least one acidic residue N-terminal to 

Y (Table 1). A final peptide, Y998, contains a single tyrosine 
residue that is neither in a YMXM motif nor adjacent to an 
N-terminal acidic residue; like four of the YMXM peptides, 
however, it does contain a serine and proline at the Y + 4 and 

Y + 5 positions (Table 1). 

♦Peptides are named according to the position of tyrosine in the 
sequence of IRS-1 (e.g., a peptide corresponding to the sequence 
surrounding Y 987 is called Y987); an analogue of Y 987 in which 
methionine at the +1 position is mutated to threonine is called 
Y987(M988T). Peptide sequences are listed in Table 1. 



Table 1. Synthetic peptides corresponding to native and 
modified sequences of IRS-1 



Position/name 



Sequence 



Double tyrosine 

Y46 
YXXM motifs 

Y460 

Y546 
YMXM motifs 

Y608 

Y628 

Y658 

Y727 

Y939 

Y987 

Nonspecific sequence 
Y998 

Modified Y987 sequences 
Y987(D986N) 
Y987(M988I) 
Y987(M988T) 
Y987(M988Nle) 
Y987(M990T) 



RLEYIENEKK 

/TRGEEELSNXX£1GGK 
ff/CVSIEEYTElMPA/T 

K tfHTD D GXMfiS P G V A 
RKGNGDGXMBSPKSV 
ffff R V D P N GUSLIS P S G S 
OTfLPATGDXMMSPVGD 
/TKGSEEIfiMDLGPGR 
iCKSRGDXflUQIG 

KPRNSXVDTS£VAP JC 

/fKSRGfflimiQIG 
ffffSRGDXHUQIG 
tfffSRGDXnJBLQIG 
/UfSRGDXHflQIG 
JTKSRGDX&12QIG 



Phosphorylated tyrosines are boldface and underlined, as are 
additional elements of the YMXM specificity motif. Non-native 
residues are in italics; basic residues were added for assay purposes 
and there is a cysteine to alanine substitution for peptide Y727. For 
modified Y987 sequences, substituted positions are italicized. 
T Norleucine. 

Additional analogues of Y987 (a low A: m substrate with a 
complete YMXM motif) were prepared to test the functional 
importance of residues surrounding the targeted tyrosine 
(Table 1). To determine directly the effect of isoleucine or 
threonine in place of methionine at the Y + 1 position, as 
occurs naturally in sequences surrounding Y460 and Y546, 
these residues were substituted into the sequence of Y987. In 
addition, norleucine, which is isomorphic with methionine, 
was substituted at the Y + 1 position to further test require- 
ments for side-chain hydrophobicity and flexibility. The 
function of methionine at the Y + 3 position of Y987 was also 
tested by analogous substitution to threonine, and the re- 
quirement for an acidic residue (aspartic acid at Y — 1) was 
tested by a conservative substitution to asparagine. 

Peptide Phosphorylations and Kinetic Analyses. All peptides 
(Table 1) were phosphorylated by the insulin-stimulated and 
autoactivated receptor kinase in a time- and temperature- 
dependent fashion and displayed saturable kinetics (Fig. 1 A 
and C), which yielded linear Lineweaver-Burk plots (Fig. 1 
B and D). In every case, phosphorylation reactions were 
linear for at least 5 min (data not shown), validating condi- 
tions used for determinations of K m and V max values (Table 
2). Rates of peptide phosphorylation (V max ) and peptide 
concentrations required for half-maximal saturation (K m ) 
varied dramatically between peptides (Fig. 1). The slopes of 
double reciprocal plots, which are proportional to Km/V^ 
(the inverse of catalytic efficiency), clustered into two groups 
(Fig. 1 B and Z>). Notably, native sequences having entire 
YMXM motifs displayed the smallest slopes (Fig. ID) and 
were therefore phosphorylated most efficiently. By contrast, 
YXXM peptides and other sequences not containing com- 
plete YMXM motifs all displayed steeper slopes (Fig. IB), 
demonstrating that they were phosphorylated less efficiently. 

Calculation of kinetic constants revealed that K m values for 
YMXM peptides ranged from 24 to 92 pM, lower than values 
previously reported for any peptides used as exogenous 
substrates of the insulin receptor (Table 2). Values for 
which range from 0.9 to 1.7 pmol/min (69-131 nmol* 
min _1 *mg~ l ), are difficult to compare to previous studies as 
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Fig. 1. Phosphorylation of peptides corresponding to native 
IRS-1 sequences by the insulin receptor kinase. Representative V vs. 
substrate concentration (A and C) and double reciprocal plots (B and 
D) are shown for phosphorylation of non-YMXM sequences (A and 
B) and YMXM sequences (C and D). Peptide sequences are identified 
in Table 1; methods used for phosphorylation reactions and data 
handling are described in Experimental Procedures. 

this number varies with kinase specific activity. Values for 
*cat/^m, determined in this study, range from 0.6 to 2.1 x 10 4 
M _1 'S -1 . Therefore, in terms of substrate binding (estimated 
by ^m)» turnover rates (/cc a t)» and overall catalytic efficiency 
(*cat/^m)> all of the YMXM peptide sequences are excellent 
substrates of the insulin receptor kinase. 

Similar analyses were performed with YXXM motifs (pep- 
tides Y460 and Y546; Table 1). Interestingly, K m for peptide 
Y460 phosphorylation (isoleucine at the Y + 1 position) was 
73 ^M, within the range of values for YMXM peptides; this 
peptide was a less efficient substrate because V^* was 0.30 
pmol/min, well below the range for YMXM peptides, which 
suggests that side-chain flexibility in addition to hydropho- 

Table 2. Summary of kinetic constants 











^catt 




Peptide 


Km 




pmol/min 


S" 1 






Double tyrosine 














Y46 


140 


±85 


0.3 ± 0.06 


0.14 


1.0 x 


10 3 


YXXM motifs 














Y460 


73 


± 15 


0.3 ± 0.03 


0.16 


2.1 x 


10 3 


Y546 


300 


± 30 


1.4 ± 0.07 


0.73 


2.4 x 


10 3 


YMXM motifs 














Y608 


90 


± 20 


1.1 ± 0.09 


0.57 


6.2 x 


10 3 


Y628 


60 


± 8 


1.2 ± 0.06 


0.63 


1.0 x 


10 4 


Y658 


86 


± 16 


1.6 ± 0.12 


0.83 


9.7 x 


10 3 


Y727 


24 


± 2.9 


0.9 ± 0.06 


0.47 


2.0 x 


10 4 


Y939 


61 


± 29 


1.7 ±0.3 


0.89 


1.5 x 


10 4 


Y987 


34 


± 8.8 


1.4 ± 0.11 


0.73 


2.1 x 


10 4 


Nonspecific 














sequence 














Y998 


3200 


± 600 


0.4 ± 0.04 


0.21 


6.5 x 


10 


Modified Y987 














sequences 














Y987(D986N) 


250 


± 69 


3.9 ± 0.6 


2.0 


8.1 x 


10 3 


Y987(M988I) 


80 


± 15 


0.9 ± 0.07 


0.47 


5.9 x 


10 3 


Y987(M988T) 


250 


± 80 


2.7 ± 0.44 


1.4 


5.6 x 


10 3 


Y987(M988Nle) 


24 


±7.1 


1.3 ± 0.12 


0.68 


2.8 x 


10 4 


Y987(M990T) 


370 


± 110 


1.2 ± 0.13 


0.63 


1.7 x 


10 3 



bicity at the Y + 1 position might be necessary for high 
turnover and catalytic efficiency. By contrast, the reduced 
catalytic efficiency of peptide Y546 (threonine at the Y + 1 
position) was due to a high K m for phosphorylation, which at 
300 /iM was substantially greater than corresponding values 
for YMXM peptides; in this case V ma)l , at 1.4 pmol/min, was 
similar to the YMXM peptides. Within a limited context, 
these results begin to suggest that the unique character of 
methionine (being flexible and hydrophobic) performs a 
special function in enhanced catalytic efficiency. 

To test this more directly, the Y + 1 position within 
YMXM peptide Y987 was substituted with isoleucine, thre- 
onine, and norleucine (Table 1). In fact, both M988I and 
M988T substitutions reduced catalytic efficiency {k^jK^ 
nearly 4-fold (Fig. 2B> compare slopes); the reasons for 
reduced efficiency were different for the two peptides, how- 
ever. The M988I substitution increased K m 2- to 3-fold and 
reduced V max nearly 2-fold (Table 2). By contrast, the M988T 
substitution increased both K m (7- to 8-fold) and (2-fold). 
Thus, in a single, defined YMXM sequence M988I and 
M988T substitutions have the same general effect as ob- 
served for the YXXM peptides (Y460 and Y546). Substitution 
of methionine with norleucine, whose side chain mimics that 
of methionine regarding both hydrophobicity and flexibility, 
had no observable effect on K my V max , and Acat/^m (Fig- 2B; 
Table 2). 

For further comparison, we substituted the Y + 3 methi- 
onine of peptide Y987 and the acidic residue N-terminal to 
tyrosine (Y - 1). Surprisingly, the M991T substitution had an 
effect even greater than the related Y + 1 substitutions (Fig. 
2D), with a 12-fold reduction in catalytic efficiency (k^/K^ 
resulting exclusively from an increase in K m (Table 2). 
Therefore, methionine residues at Y + 1 and Y + 3 positions 
play a very special role in directing efficient catalysis by the 
insulin receptor kinase. Acidic residues, which exist near 
autophosphorylated tyrosines in the insulin receptor and 
other protein tyrosine kinases, are presumed to be important 
for intermolecular substrate recognition as well (23-27). The 
only acidic residue in the Y987 sequence was substituted with 
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Fig. 2. Insulin receptor-catalyzed phosphorylation of substi- 
tuted peptides corresponding to the IRS-1 sequence surrounding 
Y987. Representative V vs. substrate concentration (A and C) and 
double reciprocal plots (B and D) are shown for phosphorylation of 
peptides substituted at the Y + 1 position (A and B) or Y - 1 and Y 
+ 3 positions (Cand D). Peptide sequences are identified in Table 1; 
methods used for phosphorylation reactions and data handling are 
described in Experimental Procedures. 
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asparagine to assess the isolated effect of having no negative 
charge N-terminal to tyrosine (Fig. 2D). K m was increased 
8-fold while 

Vmax was increased nearly 3-fold. These changes 
caused a net 2.5-fold reduction in catalytic efficiency (k^/ 
K m ). Surprisingly, this is less deleterious than the corre- 
sponding effects observed for substituting methionine resi- 
dues at either Y + 1 or Y + 3 positions (Table 2). 

Two additional peptides corresponding to IRS-1 sequences 
not associated with YXXM or YMXM motifs were studied 
for comparison. Y46 contains a paired YY sequence flanked 
by acidic residues (EYYENE), resembling to a degree the 
major phosphorylation site of the insulin receptor itself 
(DIYETDYY) (28-30). The K m for peptide Y46 phosphoryl- 
ation was 134 /iM, higher than that observed for YMXM 
peptides (Table 2), and V max was 3- to 4-fold lower. There- 
fore, although Y46 can be considered to be a good substrate 
of the insulin receptor, it was phosphorylated with lower 
efficiency (6- to 36-fold) than YMXM peptides. Peptide Y998, 
which contains serine and proline at the Y + 4 and Y + 5 
positions but lacks an acidic residue and methionine alto- 
gether, has a K m for phosphorylation that is markedly ele- 
vated (3.2 mM). In addition, V max is reduced (0.4 pmol/min) 
so k caX /K m is dramatically reduced (65 M" 1 ^" 1 ), which dem- 
onstrates the importance of primary sequence in directing 
insulin receptor kinase action and suggests that the effects of 
acidic residues N-terminal to tyrosine and methionine resi- 
dues at Y + 1 and Y + 3 positions are cumulative. 

DISCUSSION 

Unlike serine/threonine-protein kinases, whose substrate 
recognition sequences are well defined by basic residues 
neighboring the phosphate acceptor (31-33), recognition se- 
quences for substrates of protein tyrosine kinases are not well 
characterized. Autophosphorylated tyrosines within protein 
tyrosine kinase sequences are frequently located near acidic 
residues, which has led most investigators to focus on these 
positions as consensus sequences for tyrosine phosphoryla- 
tion (23-27, 34-37). Synthetic peptides corresponding to 
autophosphorylation sites are phosphorylated by tyrosine 
kinases with A; m values typically in the millimolar range, 
which is considerably higher than values obtained for peptide 
substrates of serine/threonine kinases (23-27, 31-37). Unre- 
lated peptides such as angiotensin and gastrin analogues are 
also phosphorylated by protein tyrosine kinases. Angioten- 
sins I and II (both containing the DRVY*IHPF sequence) are 
phosphorylated with K m values in the 1-4 mM range (26, 34), 
while gastrin analogues having five sequential glutamates 
N-terminal to tyrosine are phosphorylated with K m values in 
the 50-200 /aM range (35, 36). In fact, of all synthetic peptides 
studied previously as tyrosine kinase substrates, gastrin had 
the lowest AT m , a fact supporting the notion that acidic 
residues are important for tyrosine phosphorylation. How- 
ever, systematic studies to define substrate specificity 
showed that when acidic residues of various peptides were 
replaced by uncharged amino acids, effects on Vmzx/Km were 
often relatively small (2- to 4-fold) (23-27, 34-37). 

The insulin receptor has tyrosine kinase activity similar to 
that of the viral transforming proteins, their cellular coun- 
terparts, and growth factor receptors. The insulin receptor 
phosphorylates the src peptide and angiotensin with K m 
values in the millimolar range (26) and, in addition, phos- 
phorylates a peptide corresponding to its own major auto- 
phosphorylation site (residues 1154-1164) (28-30) with a K m 
value of 0.2-0.3 mM (18, 23, 30, 37). While the sequence at 
residues 1154-1164 might be a preferred substrate sequence, 
it was the recent identification of an endogenous substrate 
that allowed us to test sequences that might direct intermo- 
lecular substrate recognition. Finding biologically important 
sites of substrate phosphorylation will facilitate testing the 
importance of elements of secondary or tertiary structure in 



addition to that of primary sequence. For the insulin recep- 
tor, IRS-1 may be such a substrate. IRS-1 is a component of 
ppl85 that is immunoprecipitated from insulin-stimulated 
cells by anti-phosphotyrosine antibodies. IRS-1 is phosphor- 
ylated on tyrosine residues in response to insulin (5-7) and 
appears to link the receptor to other components of the 
intracellular signaling cascade (e.g., PI 3-kinase; ref. 7). 

As yet, we do not know which of the 34 tyrosine residues 
in IRS-1 are actually phosphorylated, although 14 have acidic 
residues N-terminal to the tyrosine position, which would 
categorize them as conventional consensus sequences for 
tyrosine kinase recognition. Surprisingly, 6 of these are 
within YMXM motifs and 2 more are in YXXM sequences, 
which previous studies (8-10, 38) suggest are important for 
interactions with PI 3-kinase and other proteins having SH2 
domains. In this report, we show that synthetic peptides 
corresponding to IRS-1 YMXM sequences are excellent 
substrates for the insulin receptor kinase. In fact, calculated 
values of K m are lower than has been reported previously for 
peptides phosphorylated by any tyrosine kinase, with the 
exception of epidermal growth factor receptor-catalyzed 
phosphorylation of gastrin (which contains the homologous 
EEEEEAYGWM sequence; refs. 35 and 36). 

Amino acid substitution studies show that different posi- 
tions within the YMXM motif effect catalytic efficiency. 
Substitutions of methionine at Y + 1 decrease catalytic 
efficiency for tyrosine phosphorylation ^4-fold, estimated by 
(0 comparisons of k^JK m for peptides Y460 and Y546 
(YXXM motifs) and the six YMXM peptides, and (//) direct 
comparisons between Jtcat/^m for the wild-type Y987 se- 
quence and substituted peptides Y987(M988I) and 
Y987(M988T). Notably, norleucine and methionine at the Y + 
1 position are functionally indistinguishable. Substitution of 
methionine at Y + 3 with threonine has an even greater effect 
on peptide Y987 phosphorylation, in this case reducing 
kcax/Km ~12-fold due to an effect on K m . Both of these effects 
are greater than that produced by substituting aspartic acid 
with asparagine at Y - 1 of peptide Y987 («2.5-fold reduction 
in ken/Km). Sequences having neither N-terminal acidic res- 
idues nor methionine at Y + 1 and Y + 3 positions are 
phosphorylated much less efficiently. We do not know the 
significance of residues at the Y + 2 position (proline, methi- 
onine, asparagine, or threonine in the YMXM sequences 
studied here). Four of the six YMXM sequences of IRS-1 also 
have serine and proline at the Y + 4 and Y + 5 positions, 
although peptides with or without the YMXMSP residues are 
phosphorylated with equal efficiency. Furthermore, the Y998 
sequence contains serine and proline at the Y + 4 and Y + 5 
positions but without an acidic residue or methionine phos- 
phorylation was inefficient. The function (if any) served by 
these residues appears not to be related to phosphorylation. 

What, if anything, is special about Y + 1 and Y + 3 
positions, and why does methionine at these positions enhance 
kinetic efficiency? Sequences directly contiguous to and in- 
cluding YMXM are predicted by the Chou-Fasman algorithm 

(41) to be unstructured, although 0-turns are predicted to flank 
each of the YMXM motifs of IRS-1 (data not shown). Alter- 
natively, it has been proposed that substrates might adopt 
amphiphilic helical structures at the kinase catalytic surface 

(42) ; if so, methionine residues at Y + 1 and Y + 3 positions 
might be positioned appropriately for direct participation in 
binding. While we have not yet been able to analyze structure, 
the unique character of methionine may play a special role 
here. Isoleucine, leucine, and valine, while also highly hydro- 
phobic, have branched and thus relatively rigid side chains. By 
contrast, the side chain of methionine is unbranched, provid- 
ing considerable structural flexibility. In support of this, 
norleucine, which is equally flexible and hydrophobic com- 
pared to methionine, has an identical capacity to direct insulin 
receptor kinase action. This feature of methionine has recently 
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been proposed to provide a malleable nonpolar surface for 
signal peptide recognition (43) and plasticity in protein inter- 
actions with calmodulin (39), which would allow structurally 
diverse nonpolar surfaces to conform to the respective rec- 
ognition surfaces of the proteins to which they bind (40). Such 
a malleable, nonpolar surface might provide a mechanism to 
explain how the same IRS-1 YMXM motifs interact with the 
insulin receptor kinase and PI 3-kinase and, potentially, other 
tyrosine kinases and proteins with SH2 domains as well. The 
side chains of residues in the YMXM domain might be 
sufficiently flexible to provide a nonpolar surface that accom- 
modates itself equally to the surfaces of kinase active sites and 
SH2 protein binding pockets. 

Relationships between IRS-1 and other signal transduction 
proteins may be quite complex, as IRS-1 contains multiple 
copies of the YMXM motif that could interact selectively with 
different tyrosine kinases as well as distinct effector molecules 
containing various isoforms of the SH2/SH3 domain. Fur- 
thermore, other proteins involved in signal transduction con- 
tain related sequences. The PDGF receptor phosphorylates 
itself within YMXM or YVXM motifs (44), the src-associated 
middle T antigen contains a phosphorylated EEEEEEYMPM 
sequence (8, 45), and similar sequences within the colony- 
stimulating factor 1 receptor (DTYVEM) and kit oncogene 
(DSTNEYMDM) may be phosphorylated, as well (8). 
Whether YMXM motifs define a recognition sequence for 
efficient tyrosine phosphorylation by these and other protein 
tyrosine kinases remains to be tested. For the src-associated 
middle T antigen and the PDGF receptor, these sequences are 
thought to be involved in recognition by PI 3-kinase through 
interaction with one or the other of its SH2 domains (8, 9, 38, 
44). Similarly, studies to block interactions between IRS-1 and 
PI 3-kinase with synthetic phosphopeptides corresponding to 
IRS-1 YMXM sequences suggest that these interactions in- 
volve SH2 domains as well (unpublished data). 

In conclusion, YMXM, a redundant sequence motif iden- 
tified in IRS-1, is shown to define substrate specificity of the 
insulin receptor kinase. Methionine residues at the Y + 1 and 
Y + 3 positions are particularly important for efficient kinase 
recognition, in addition to acidic residues N-terminal to 
tyrosine. Knowing a target motif for the action of the insulin 
receptor kinases, and possibly other tyrosine kinases as well, 
may enhance our understanding of tyrosine kinase signaling 
pathways. Furthermore, a tyrosine kinase target motif may 
facilitate identification of additional known or newly identi- 
fied proteins that might be involved in related signaling 
pathways. Certainly, as genomic sequencing efforts pro- 
gress, it will be increasingly useful to have identified short 
peptide motifs to help decipher which of these DNA se- 
quences encodes proteins acting as potential targets of tyro- 
sine kinase action. 
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